(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 




(43) International Publication Date (10) International Publication Number 

23 May 2002 (23.05.2002) PC T WO 02/41301 Al 



(51) International Patent Classification 7 : G10L 19/00, 
H03M 7/30 

(21) International Application Number: PCT/SE01/02510 

(22) International Filing Date: 

13 November 2001 (13.11.2001) 

(25) Filing Language: English 

(26) Publication Language: English 

(30) Priority Data: 

0004163-2 14 November 2000 (14.1 1.2000) SB 

(71) Applicant (for all designated States except US): CODING 
TECHNOLOGIES SWEDEN AB [SB/SB]; EKJbelns- 
gatan 64, S-113 52 Stockholm (SB). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): KjORLING, 
Krlstofer [SE/SB]; Lostigen 10, S-170 75 Solna (SB). 
EKSTRAND, Per [SE/SE]; Sadermannagatan 45, S-U6 
40 Stockholm (SB). HENN, Fredrik [SE/SB]; Ritarva 4 - 
gen 14, S-168 31 Bromma (SB). VILLEMOES, Lars 
[SE/SE]; Mandolinviigen 22, S-175 56 Jarfalla (SB). 



(74) Agents: ORTENBLAD, Bertll et al; Norens Patentbyra 
AB, Box 10198, S-1G0 55 Stockholm (SB). 



(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DB, DK, DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, 
HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, 
LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, 
MZ, NO, NZ, PL, PT, RO ( RU, SD, SB, SG, SI, SK, SL. 
TJ, TM, TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, ZW. 



(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, C Y, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SB, TR), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GQ, GW, ML, MR, NB, SN, TO, 
TO). 



Published: 

— with international search report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette, 



= (54) Title: ENHANCING PERCEPTUAL PERFORMANCE OF HIGH FREQUENCY RECONSTRUCTION CODING METH- 
= ODS BY ADAPTIVE FILTERING 



Envelope 
data 




Wide band 
signal out 



r5 

(57) Abstract: The present invention proposes a new method and a new apparatus for enhancement of audio source coding systems 
Q utilising high frequency reconstruction (HFR). It utilises adaptive filtering to reduce artifacts due to different tonal characteristics 
^ in different frequency ranges of an audio signal upon which HFR is performed. The present invention is applicable to both speech 
^ coding and natural audio coding systems. 



ENHANCING PERCEPTUAL PERFORMANCE OF HIGH FREQUENCY 
RECONSTRUCTION CODING METHODS BY ADAPTIVE FILTERING 



TECHNICAL FIELD 

.5 The present invention relates to audio source coding systems utilising high frequency reconstruction 
(HFR) such as Spectral Band Replication, SBR [WO 98/57436] or related methods. It improves 
performance of high quality methods (SBR), as well as low quality methods [U.S. Pat. 5,127,054]. It 
is applicable to both speech coding and natural audio coding systems. 

10 BACKGROUND OF THE INVENTION 

In high frequency reconstruction of audio signals, where a highband is extrapolated from a lowband, 
it is important to have means to control the tonal components of the reconstructed highband to a 
greater extent than what can be achieved with a coarse envelope adjustment, as commonly used in 
HFR systems. This is necessary since the tonal components for most audio signals such as voices and 

15 most acoustic instruments, usually are stronger in the low frequency regions (i.e. below 4-5kHz) 

compared to the high frequency regions. An extreme example is a very pronounced harmonic series in 
the lowband and more or less pure noise in the high band. One way to approach this is by adding 
noise adaptively to the reconstructed highband (Adaptive Noise Addition [PCT/SEOO/00159]). 
However, this is sometimes not enough to suppress the tonal character of the lowband, giving the 

20 reconstructed highband a repetitive tc buzzy" sound character. Furthermore, it can be difficult to 

achieve the correct temporal characteristics of the noise. Another problem occurs when two harmonic 
series are mixed, one with high harmonic density (low pitch) and the other with low harmonic density 
(high pitch). If the high-pitched harmonic series dominates over the other in the lowband but not in 
the highband, the HFR causes the harmonics of the high-pitched signal to dominate the highband, 

25 making the reconstructed highband sound "metallic" compared to the original. None of the above- 
described scenarios can be controlled using the envelope adjustment commonly used in HFR systems. 
In some implementations a constant degree of spectral whitening is introduced during the spectral 
envelope adjustment of the HFR signal. This gives satisfactory results when that particular degree of 
spectral whitening is desired, but introduces severe artifacts for signal excerpts that do not benefit 

30 from that particular degree of spectral whitening. 

SUMMARY OF THE INVENTION 

The present invention relates to the problem of "buzziness" and << metallic"-sound that is commonly 
introduced in HFR-methods. It uses a sophisticated detection algorithm on the encoder side to 
35 estimate the preferable amount of spectral whitening to be applied in the decoder. The spectral 

whitening varies over time as well as over frequency, ensuring the best means to control the harmonic 



contents of the replicated highband. The present invention can be carried out in a time-domain 
implementation as well as in a subband filterbank implementation. 

The present invention comprises the following features: 
5 - In the encoder, estimating the tonal character of an original signal for different frequency regions 
at a given time. 

- In the encoder, estimating the required amount of spectral whitening, for different frequency 
regions at a given time, in order to obtain a similar tonal character after HFR in the decoder, given 
the HFR-method used in the decoder, 

10 - Transmitting the information on preferred degree of spectral whitening from the encoder to the 
decoder. 

- hi the decoder, perform spectral whitening in either the time domain or in a subband filterbank, in 
accordance with the information transmitted from the encoder. 

- The adaptive filter used for spectral whitening in the decoder is obtained using linear prediction. 
1 5 - The degree of spectral whitening required is assessed in the encoder by means of prediction. 

- The degree of spectral whitening is controlled by varying the predictor order, or by varying the 
bandwidth expansion factor of the LPC polynomial, or by mixing the filtered signal, to a given 
extent, with the unprocessed counterpart 

- The ability to use a subband filterbank achieving low-order predictors, offers very effective 
20 implementation, especially in a system where a filterbank already is used for envelope 

adjustment. 

- Frequency selective degree of spectral whitening is easily obtained given the novel filterbank 
implementation of the present invention. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will now be described by way of illustrative examples, not limiting the scope or 
spirit of the invention, with reference to the accompanying drawings, in which: 
Fig, 1 illustrates bandwidth expansion of an LPC spectrum; 
Fig. 2 illustrates the absolute spectrum of an original signal at time t Q , and time t x ; 
30 Fig. 3 illustrates the absolute spectrum of the output, at time t Q and time t x , of a prior art copy up 
HFR system without adaptive filtering; 

Fig. 4 illustrates the absolute spectrum of the output, at time / 0 and time t x , of a copy up HFR system 
with adaptive filtering, according to the present invention; 
Fig. 5a illustrates a worst case signal according to the present invention; 
35 Fig. 5b illustrates the autocorrelation for the highband and lowband of the worst case signal; 
Fig. 5c illustrates the tonal to noise ratio q for different frequencies, according to the present 
invention; 

Fig. 6 illustrates a time domain implementation of the adaptive filtering in the decoder, according to 
the present invention; 

40 Fig. 7 illustrates a subband filterbank implementation of the adaptive filtering in the decoder, 
according to the present invention; 



Fig. 8 illustrates an encoder implementation of the present invention; 
Fig. 9 illustrates a decoder implementation of the present invention. 



DESCRIPTION OF PREFERRED EMBODIMENTS 

The below-described embodiments are merely illustrative for the principles of the present invention 
for improvement of high frequency reconstruction systems. It is understood that modifications and 
variations of the arrangements and the details described herein will be apparent to others skilled in the 
art. It is the intent, therefore, to be limited only by the scope of the impending patent claims and not 
by the specific details presented by way of description and explanation of the embodiments herein. 

When adjusting a spectral envelope of a signal to a given spectral envelope a certain amoxmt of 
spectral whitening is always applied. This, since if the transmitted coarse spectral envelope is 
described by H ww ^ (z) and the spectral envelope of the current signal segment is described by 
# envCur (z) , the filter function applied is 

"euvCur \ z ) 



10 



In the present invention the frequency resolution for # cnvRef (z) is not necessarily the same as for 
7/envCur (z) , The invention uses adaptive frequency resolution of H tnvCur (z) for envelope 
adjustment of HFR signals. The signal segment is filtered with the inverse of H cnyCvtT (z) , in order to 
20 spectrally whiten the signal according to Eq. 1. If H eav(>u (z) is obtained using linear prediction, it 
can be described according to 

where 

A{z) = \-f u a k z- k (3) 

25 is the polynomial obtained using the autocorrelation method or the covariance method pigital 

Processing of Speech Signals, Rabiner & Schafer, Prentice Hall, Inc., Bnglewood Cliffs, New Jersey 
07632, ISBN 0-13-213603-1, Chapter 8], and G is the gain. Given this, the degree of spectral 
whitening can be controlled by varying the predictor order, i.e. limiting the order of the polynomial 
A [z) , and thus limiting the amount of fine structure that can be described by /? envCur (z) , or by 

30 applying a bandwidth expansion factor to the polynomial A [z) . The bandwidth expansion is defined 
according to the following; if the bandwidth expansion factor is p , the polynomial A{z) evaluates to 



A{pz) = a Q z t> p°+a { z { p l +a 2 z 2 p 2 +„.+a p z p p p . (4) 



20 



This expands the bandwidth of the formants estimated by # envC ur { z ) according to Fig. 1. The 
inverse filter at a given time is thus, according to the present invention, described as 



H inv (z, p s p) - , (5) 

where p is the predictor order and p is the bandwidth expansion factor. 



The coefficients cc k can, as mentioned above, be obtained in different manners, e.g. the 
autocorrelation method or the covariance method. The gain factor G can be set to one if Hj nv is used 
prior to a regular envelope adjustment. It is common practice to add some sort of relaxation to the 
estimate in order to ensure stability of the system. When using the autocorrelation method this is 
10 easily accomplished by offsetting the zero-lag value of the correlation vector. This is equivalent to 
addition of white noise at a constant level to the signal used to estimate .A (z) . The parameters 
p and p are calculated based on information transmitted from the encoder. 

An alternative to bandwidth expansion is described by; 
15 4(z) = l-$+£^(zj, (6) 

where b is the blending factor. This yields the adaptive filter according to: 



G 



(7) 



Here it is evident that for 6 = 1 Eq. 7 evaluates to Eq. 5 with p = 1 , and ioxb = 0 Eq. 7 evaluates to a 
constant non-frequency selective gain factor. 



The present invention drastically increases the performance of HFR systems, at a very low additional 
bitrate cost, since the information on the degree of whitening to be used in the decoder can be 
transmitted very efficiently. Fig. 2 - 4 displays the performance of a system with the present invention 
compared to a system without, by means of illustrative absolute spectra. In Fig. 2 absolute spectra of 

25 the original signal at time t Q and time t x are displayed. It is evident that the tonal character for the 
lowband and the highband of the signal is similar at time t 0 , while they differ significantly at time t x . 
In Fig. 3 the output at time t 0 and time f, of a system using a copy-up based HFR without the present 
invention are displayed. Here, no spectral whitening is applied giving the correct tonal character at 
time f 0 , but entirely wrong at time t { . This causes very annoying artifacts. Similar results would be 

30 obtained for any constant degree of spectral whitening, albeit the artifacts would have different 

characters and occur at different instances. In Fig. 4 the output at time t 0 and time t x of a system using 



the present invention are displayed. Here it is evident that the amount of spectral whitening varies over 
time, which results in a sound quality far superior to that of a system without the present invention. 



The detector on the encoder side 
5 In the present invention, a detector on the encoder-side is used to assess the best degree of spectral 
whitening (LPC order, bandwidth expansion factor and/or blending factor) to be used in the decoder, 
in order to obtain a highband as similar to the original as possible, given the currently used HFR 
method. Several approaches can be used in order to obtain a proper estimate of the degree of spectral 
whitening to be used in the decoder. In the following description below, it is assumed that the HFR 

10 algorithm does not substantially alter the tonal structure of the lowband spectrum during the 
generation of high frequencies, i.e. the generated highband has the same tonal character as the 
lowband. If such assumptions cannot be made the below detection can be performed using an analysis 
by synthesis, i.e. performing HFR on the original signal in the encoder and do the comparative study 
on the highbands of the two signals, rather than doing a comparative study on the lowband and 

1 5 highband of the original signal. 

One approach uses autocorrelation to estimate the appropriate amount of spectral whitening. The 
detector estimates the autocorrelation functions for the source range (i.e. the frequency range upon 
which the HFR will be based in the decoder) and the target range (i.e. the frequency range to be 

20 reconstructed in the decoder). In Fig 5a. a worst case signal is described, with a harmonic series in 
the lowband and white noise in the highband. The different autocorrelation functions are displayed in 
Fig 5b. Here it is evident that the lowband is highly correlated whilst the highband is not. The 
maximum correlation, for any lag larger than a minimum lag, is obtained for both the highband and 
the lowband. The quotient of the two is used to calculate the optimal degree of spectral whitening to 

25 be applied in the decoder. When implementing the present invention as outlined above, it may be 
preferable to use FFTs for the computation of the correlation. The autocorrelation of a sequence 
x(n)is defined by: 

^W-^jlKkjf), (8) 

where 

30 X(k) = FFT(x(n)). (9) 



Since the objective is to compare the difference of the autocorrelation in the highband and the 
lowband the filtering can be done in the frequency domain. This yields: 



jr^(*)-*(*).j^(*) 

\X Hp {k) = X{k)H Hp {k)' 



(10) 



where (k) and H Hp (k) are the Fourier transforms of the LP and HP filters impulse responses. 



From the above the autocorrelation functions for the lowband and highband can be calculated 
according to: 



r„ Hp {yn) = FFT^X Hp {kf] 



5 



The maximum value, for a lag larger than a minimum lag, for each autocorrelation vector is 
calculated: 



r MaxLp = max (r^ ) V. m > minLag 

/kflx^^max^) V w> minLag 



(i2).' : 



The quota of the two can be used to for instance map to a suitable bandwidth expansion factor. 

The above implies that it would be beneficial to assess a general measurement of the predictability, 
10 i.e. the tonal to noise ratio of a signal in a given frequency band at a given time, in order to obtain a 5 
correct inverse filtering level for a given frequency band at a given time. This can be accomplished 
using the more refined approach below. Here a subband filterbank is assumed, it is well understood 
however that the invention is not limited to such. 

15 A tonal to noise ratio q for each subband of a filter bank can be defined by using linear prediction on 
blocks of subband samples. A large value of q indicates a large amount of tonality, whereas a small 
value of q indicates that the signal is noiselike at the corresponding location in time and frequency. ' 
The q -value can be obtained using both the covariance method and the autocoirelation method. 

20 For the covariance method, the linear prediction coefficients and the prediction error for the subband 
signal block [x(0),x(l) v ..,*(N-l)] can be computed efficiently by using the Cholesky 

decomposition, [Digital Processing of Speech Signals, Rabiner & Schafer, Prentice Hall, Inc., 
Bnglewood Cliffs, New Jersey 07632, ISBN 0-13-213603-1, Chapter 8]. The tonal to noise ratio q is 
then defined by 



25 



E 



(13) 



whereT = |x(0)| +|x(l)| +... + |x(tf-l)f is the energy of the signal block, and E is the energy of 
the prediction error block. 



For the autocorrelation method, a more natural approach is to use the Levinson-Dqrbin algorithm, 
30 [Digital Signal Processing, Principles, Algorithms and Applications, Third Edition, John G. Proakis, 
Dimitris G. Manolakis, Prentice Hall, International Editions, ISBN-Q-13-394338-9, Chapter 1 1] where 



q is then defined according to 




where K t are the reflection coefficients of the corresponding lattice filter structure obtained from the 
prediction polynomial, and p is the predictor order. 

The ratio between highband and lowband values of q is then used to adjust the degree of spectral 
whitening such that the tonal to noise ratio of the reconstructed highband approaches that of the 
original highband. Here it is advantageous to control the degree of whitening utilising the blending* . 
factor b (Eq. 6). ... 

Assuming the tonal to noise ratio q-q H is measured in the highband and q-qi^q^ is measured 
in the lowband, a suitable choice of whitening factor b is given by the formula 

*=1-J^. • ' (15) 

To see this, a first step is to rewrite Eq. 6 in the form 

A b (z) = A(z) + {l-b)(l-A(z)). (16) 

This shows that if the signal used to estimate A (z) is filtered with the filter A b (z) , the predicted 
signal is suppressed by the gain factor 1-6 and the prediction error is unaltered. As the tonal to noise . 
ratio is the ratio of mean squared predicted signal to mean squared prediction error, a value of q prior 
to filtering is changed to (l — b) 2 q by the filtering operation. Applying this to the lowband signal 
produces a signal with tonal to noise ratio (l - b) 2 q L and under the assumption that the applied HFR 
method does not alter tonality, the target value q H in the highband is reached exactly if b is chosen 
according to Eq. 15. 

The values of q based on prediction order p = 2 in each subband of a 64 channel filter bank are 
depicted in Fig, 5c, for the signal of Fig. 5a. Significantly higher values are reached for the harmonic 
part of the signal than for the noisy part. The variability of the estimates in the harmonic part is due to 
the chosen frequency resolution and prediction order. 

Adaptive LPC-based whi tening in the time domain 

The adaptive filtering in the decoder can be done prior to, or after the high-frequency reconstruction. If 
the filtering is performed prior to the HFR, it needs to consider the characteristics of the HFR-method 
used. When a frequency selective adaptive filtering is performed, the system must deduct from what 
lowband region a certain highband region will originate, in order to apply the correct amount of 
spectral whitening to that lowband region, prior to the HFR-unit In the example below, of a time 



domain implementation of the current invention, a non-frequency selective adaptive spectral whitening 
is outlined. It should be obvious to any person skilled in the art that time-domain implementations of 
the present invention is not limited to the implementation described below. 



5 When performing the adaptive filtering in the time domain, linear prediction using the autocorrelation 

method is preferred. The autocorrelation method requires windowing of the input segment used to 

estimate the coefficients a k , which is not the case for the covariance method. The filter used for the 
spectral whitening according to the present invention is 



10 where the gain factor G (in Eq. 5) is set to one. When the adaptive spectral whitening is performed 
prior to the HFR unit, an effective implementation is achieved since the adaptive filter can operate on . 
a lower sampling rate. The lowband signal is windowed and filtered on a suitable time base with the 
predictor order and bandwidth expansion factors given by the encoder, according to Fig. 6, In the 
current implementation of the present invention the signal is low pass filtered 601 and decimated 602. 

15 603 illustrate the adaptive filter. A window 606 is used to select the proper time segment for 

estimation of the A{z) polynomial, 50% overlap is used. TheLPC-routine 607 extracts A(z) given 
the currently preferred LPC-order and bandwidth expansion factor, with a suitable relaxation, A FIR 
filter 608 is used to adaptively filter the signal segment. The spectrally whitened signal segments are 
upsampled 604, 605 and windowed together forming the input signal to the HFR unit. 

20 

Adaptive LPC-based white ning in a subband filter bank 

The adaptive filtering can be performed effectively and robustly by using a filter bank, The linear 
prediction and the filtering are done independently for each of the subband signals produced by the 
filter bank. It is advantageous to use a filterbank where the alias components of the subband signals 

25 are suppressed. This can be achieved by e.g, oversampling the filterbank Artifacts due to aliasing 
emerging from independent modifications of the subband signals, which for example adaptive 
filtering results in, can then be heavily reduced. The spectral whitening of the subband signals is 
obtained through linear prediction analogous to the time domain method described above. If the 
subband signals are complex valued, complex filter coefficients are used for the linear prediction as 

30 well as for the filtering. The order of the linear prediction can be kept very low since the expected 
number of tonal components in each frequency band is very small for a system with a reasonable 
amount of filterbank channels. In order to correspond to the same time base as the time domain LPC, 
the number of subband samples in each block is smaller by a factor equal to the downsampling of the 
filter bank. Given the low filter order and small block sizes the prediction filter coefficients are 

35 preferably obtained using the covariance method. Filter coefficient calculation and spectral whitening 
can be performed on a block by block basis using subband sample time step L , which is smaller than 
the block length K The spectrally whitened blocks should be added together using appropriate 
synthesis windowing. 




(19) 



Feeding a maximally decimated filterbank with an input signal consisting of white gaussian noise will 
produce subband signals with white spectral density. Feeding an oversampled filterbank with white 
noise gives subband signals with coloured spectral density. This is due to the effects of the frequency 
5 responses of the analysis filters. The LPC predictors in the filterbank channels will track the filter 
characteristics in the case of noise-like input signals. This is an unwanted feature, and benefits from 
compensation. A possible solution is pre-filtering of the input signals to the linear predictors. The pre- 
filtering should be an inverse, or an approximation of the inverse, of the analysis filters, in order to 
compensate for the frequency responses of the analysis filters. The whitening filters are fed with the 

10 original subband signals, as described above. Fig. 7 illustrates the whitening process of a subband 
signal. The subband signal'cbrresponding to channel / is fed to the pre-filteringblock 701, and 
subsequently to a delay chain where the depth of the same depends on the filter order 702. The 
delayed signals and their conjugates 703 are fed to the linear prediction block 704, where the 
coefficients are calculated. The coefficients from every L:th calculation are kept by the decimator 

15 705. The subband signals are finally filtered through the filterblock 706, where the predicted 
coefficients are used and updated for every L:th sample. 

Practical implementations 

The present invention can be implemented in both hardware chips and DSPs, for various kinds of 
20 systems, for storage or transmission of signals,, analogue or digital, using arbitrary codecs. Fig. 8 and 
Fig. 9 shows a possible implementation of the present invention. In Fig.8 the encoder side is displayed. 
The analogue input signal is fed to the AID converter 801, and to an arbitrary audio coder, 802, as well 
as the inverse filtering level estimation unit 803, and an envelope extraction unit 804. The coded 
information is multiplexed into a serial bitstream, 80S, and transmitted or stored. In Fig. 9 a typical 
25 decoder implementation is displayed, The serial bitstream is de-multiplexed, 901, and the envelope 
data is decoded, 902, i.e. the spectral envelope of the highband. The de-multiplexed source coded 
signal is decoded using an arbitrary audio decoder, 903; The decoded signal is fed to an arbitrary HFR 
unit, 904, where a highband is regenerated. The highband signal is fed to the spectral whitening unit 
905, which performs the adaptive spectral whitening. Subsequently, the signal is fed to the envelope 
30 adjuster 906. The output from the envelope adjuster is combined with the decoded signal fed through a 
delay, 907. Finally, the digital output is converted back to an analogue waveform 908. 



CLAIMS 

1. A method for enhancement of audio source coding systems using high-frequency reconstruction, 
where said source coding system comprises an encoder representing all operations performed prior to 
storage or transmission, and a decoder representing all operations performed after storage or 
5 transmission, characterised by: 

at said encoder, estimating the tonal character of an original signal at a given time, and 
at said encoder, estimating the required amount of spectral whitening at a given time, in order to 
obtain a similar tonal character after HFR in said decoder, given the HFR-method used in said 
decoder; 

10 transmitting information on said amount of spectral whitening fiom said enooder.to said i ■- • 

decoder; 

at said decoder, adaptively, spectrally whiten a signal prior to High Frequency Reconstruction 
(HFR) or after HFR, according to the spectral whitening information obtained from said encoder. 

15 2. A method according to claim 1, characterised In that said estimation of the tonal character of the 
original signal is done for different frequency regions. * 

3. A method according to claim 1, characterised in that said that said estimation of the required 
amount of spectral whitening is done for different frequency regions. 

20 

4. A method according to claim 1, characterised in that said spectral whitening is performed in the 
time domain, 

5. A method according to claim 1, characterised in that said spectral whitening is performed in a 
25 subband filterbank. 

6. A method according to claim 1, characterised in that said estimation of required amount of spectral 
whitening is done by comparison of the tonal to noise signal ratios q of different subband signals 
obtained from subband filtering of said original signal, where said ratios are obtained using linear 

30 prediction of said subband signals. 

7. A method according to claim 1, characterised in that said estimation of required amount of spectral 
whitening is done by comparison of the tonal to noise signal ratios q of different subband signals 
obtained from subband filtering of said original signal and a HFR signal, where said ratios are 

35 obtained using linear prediction of said subband signals, and said HFR signal is produced in a the same 
manner as said HFR in said decoder. 



40 



8. A method according to claim 1, characterised in that the amount of spectral whitening is controlled 
by the LPC predictor order. 



9. A method according to claim 1, characterised in that the amount of spectral whitening is controlled 
by the bandwidth expansion factor of the LPC polynomial. 

10. A method according to claim 1, characterised in that the amount of spectral whitening is 
5 controlled by the blending factor b. 

11. A method according to claim 5; characterised in that pre-filtering is included in the LPC 
estimation in order to compensate for the characteristic of the filterbank analysis filters. 

10 12. An apparatus for enhancement of .audio source coding systems using high-frequency 

reconstruction, where said source coding system comprises an encoder representing all operations 
performed prior to storage or transmission, and a decoder representing all operations performed after 
storage or transmission, characterised by: 

. at said encoder, means for estimating the tonal character of an original signal at a given time, 

15 and 

at said encoder, means for estimating the required amount of spectral whitening at a given time, 
in order to obtain a similar tonal character after HFR in said decoder, given the HFR-method used in 
said decoder; 

at said decoder, means for, adaptively, spectrally whiten a signal prior to High Frequency 
20 Reconstruction (HFR) or after HFR, according to the spectral whitening information obtained from 
said encoder. 



1/10 



LPC order 40 




1000 



2000 3000 4000 
Frequency [Hz] 



5000 



6000 



Fig.l 



SUBSTITUTE SHEET (RULE 26) 



2/10 



Spectrum of original signal at time tO 
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Spectrum of output using HFR without the present 
invention at time tO 
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Spectrum of output using HFR with the present invention 
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Autocorrelation of the lowband (zero lag = 2048) 
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